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METHOD AND DEVICE FOR IMPROVING THE RELIABILITY OF A 
COMPUTER SYSTEM 

The present invention relates to computer 
systems. In particular, the invention concerns a 
5 method and device for improving the' reliarbility of a 
computer system. 

BACKGROUND OF THE INVENTION 

In computer systems, standardized bus solu- 
3 10 tions are used to interconnect different peripherals 

^ or processor systems. CompactPCI (PCI, Peripheral . Com- 

ponent Interconnect) is a bus solution based on the 
,S PCI bus, used especially in computer systems intended 

for industrial and/or embedded applications in me- 
p 15 chanically demanding environments. A more extensive 

ril description of the properties of the PCI bus is to be 

ri found in the publication "PCI Local Bus Specif ica- 

ll tion", PCI Special Interest Group, June 1, 1995. We 

m append the publication to the present application via 

2 0 this reference. 

Units connected to the bus communicate with 
each other by using a special addressing sequence. In 
certain addressing sequences, an addressing unit ad- 
dresses a unit to be addressed or waits for a response 
25 to the addressing until the addressed unit responds. 
If the unit addressed is defective, it is unable to 
respond to the addressing, in which case the entire 
computer or microcomputer system will remain waiting 
for the release of the address bus. This may result in 

3 0 an error situation in the entire system. An example of 

this type of situation arises in certain addressing 
modes of the CompactPCI bus in which the addressing 
sequence does not include any element for the monitor- 
ing of bus release. Under these conditions, a problem 
35 arises if the plug- in unit is defective, in which case 
it may, acting via an interface circuit, keep the DEV- 



wo 01/13231 



2 



PCT/FIOO/00689 



SEL# Signal of the CompactPCI bus active and the TRDY# 
signal inactive, thus indicating that it is aware of 
being addressed {DEVSEL#) but is not yet ready for ac- 
tion. The system controlling the PCI bus remains wait- 
5 ing for the release of the bus and the operation of 
the system is thus blocked up. 

Typically, the master unit of the system is 
provided with a so-called watchdog timer, which has to 
be reset at certain predetermined intervals. If the 
10 watchdog timer is not reset, then the system will re- 
boot, i.e. reset itself. This may result in an endless 
loop and a crash of the system. However, in the above 
example situation, the watchdog timer is not necessar- 
ily started at all, or it may not "notice" the prob- 
15 lematic situation that has arisen. 

The object of the present invention is to 
eliminate the problems described above or at least to 
significantly alleviate them. A further object of the 
invention is to disclose a new type of method and de- 
20 vice for disconnecting a defective unit from a com- 
puter system in the event of a fault- Another object 
of the invention is to improve the reliability of the 
PCI bus system by using a simple monitoring mechanism 
working internally in the device connected to the bus. 

25 

BRIEF DESCRIPTION OF THE INVENTION 

In the present invention, a plug- in unit con- 
nected to a PCI bus or an interface circuit acting as 
the interface between the bus and the plug-in unit is 

30 provided with a watchdog timer for internal monitoring 
of the addressing of the plug-in unit. This makes it 
possible to detect error situations that are not nec- 
essarily detected by mechanisms implemented in the 
master system controlling the PCI bus, thus allowing 

3 5 the consequent problems to be avoided. 

The invention concerns a method for improving 
the reliability of a computer system. The computer 
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system comprises a bus, preferably a CompactPCI bus. 
In addition, the system comprises an interface circuit 
and a plug -in unit which is connected to the bus via 
the interface circuit. In the method, the plug- in unit 
5 is addressed via the bus. Addressing refers to e.g. 
I/O and memory addressing directed at the plug- in 
unit. According to the invention, the duration of ad- 
dressing is monitored by the interface circuit, and if 
it exceeds a predetermined length of time, then the 

10 addressing is interrupted. The monitoring of address- 
ing can be implemented by providing the interface^ cir- 
cuit with a watchdog timer, which is activated and 
possibly initialized at the start of addressing of the 
plug-in unit. If the duration of addressing exceeds a 

15 time limit preset in the watchdog timer, then an abort 
of addressing is perforroed e.g. upon an initiative by 
the interface circuit. 

In a preferred embodiment of the method, a 
TARGET ABORT type addressing abort operation is per- 

2 0 formed by the interface circuit as described e.g. in 

the above-mentioned publication "PCI Local Bus Speci- 
fication", page 41. In this case, the interface cir- 
cuit performs the abort of addressing even if the 
plug- in unit should be out of order and unable to 
25 abort the addressing itself. 

In an embodiment, the SERR# signal is set 
into active state in the bus by the interface circuit 
after an abort of addressing. In consequence of these 
actions, the master unit controlling the bus can dis- 

3 0 connect the faulty plug- in unit from the bus. In a 

preferred embodiment of the invention, the Signaled 
System Error bit is set by the interface circuit into 
active state in the status register of the plug- in 
unit if the duration of addressing exceeds the time 
35 value preset in the watchdog timer. On the basis of 
the Signaled System Error bit, the plug- in unit de- 
tects that the interface circuit has generated an er- 
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ror message for the bus. Based on tliis, the plug- in 
unit can change its operational status, e.g. by indi- 
cating the error situation via a LED comprised in the 
plug- in unit or a corresponding signal denoting a 
5 fault. The above-mentioned signals are also described 
in the publication "PCI Local Bus Specification" re- 
ferred to above. 

The invention also concerns an interface cir- 
cuit for improving the reliability of a computer sys- 

10 tem as described above. According to the invention, 
the interface circuit comprises a watchdog tijner, 
means for starting timing upon the start of address- 
ing, and means for aborting the addressing if its du- 
ration exceeds a time value preset in the watchdog 

15 timer. In an embodiment of the invention, the bus is a 
CompactPCI bus- The interface circuit preferably com- 
prises means for terminating addressing. In addition, 
the interface circuit comprises means for setting the 
SERR# signal into active state in the bus if the dura- 

20 tion of addressing has exceeded the time limit preset 
in the watchdog timer or when the interface circuit 
has given to the bus a notice of termination of ad- 
dressing. Furthermore, the interface circuit may com- 
prise means for setting the Signaled System Error bit 

25 into active state in the status register of the plug- 
in unit if the duration of addressing has exceeded the 
time value preset in the watchdog timer or after the 
interface circuit has given to the bus a notice of 
termination of addressing. 

30 The invention provides the advantage of ena- 

bling the computer system to detect a defective plug- 
in unit and disable it without any actions on the 
user's part. Furthermore, the invention makes it pos- 
sible to avoid an error situation affecting the entire 

35 system as the faulty unit incapable of responding to 
addressing will not occupy system resources. At the 
same time, a system diagnostics arrangement tells the 
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serviceman which one of the plug- in units of the sys- 
tem needs repairing. The watchdog timer can be easily 
and economically implemented on the interface circuit, 
so the invention is applicable in many different envi- 
5 ronments . _ 

LIST OF ILLUSTRATIONS 

In the following, the invention will be de- 
scribed by the aid of a few examples of its embodi- 
10 ments with reference to the attached drawing, wherein 

Fig. la and lb present diagrams representing 
an embodiment of the system of the invention; 

Fig. 2 presents a diagram giving a general 
illustration of the signalling associated with a Com- 
15 pactPCI bus application according to the invention; 
and 

Fig. 3 presents an embodiment of the method 
of the invention in the form of a flow diagram. 

20 DETAILED DESCRIPTION OF THE INVENTION 

Fig. la presents a diagram representing a 
system according to the invention. The system com- 
prises a bus PCI, which in the case of this example is 
a CompactPCI bus. Connected to the CompactPCI bus are 

25 a plurality of plug-in units 2^, 2^, 2^ using interface 
circuits 1. A plug-in unit 2 comprised in the system 
may be e.g. a bus master unit 2^, of which there may 
be one or more. The plug-in unit 2 may also be a slave 
unit 2^, an embedded system or an auxiliary device 2 

30 enhancing the properties of the system. An example of 
the computer system is the DX2 00 telephone switching 
system manufactured by Nokia, in which the plug- in 
units are connected to a CompactPCI bus. 

The components to be connected to the Com- 

35 pactPCI bus are subject to certain requirements re- 
garding their operation. These requirements describe 
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the signalling used in the PCI bus and the operation 
of the bus after the receipt or transmission of dif- 
ferent signals. These requirements will not be de- 
scribed in detail in this context; instead, reference 
5 is made to the above-mentioned publicatiori- "PCI Local 
Bus Specification" , which gives a detailed description 
of said requirements. 

In the example, the interface circuit 1 is 
implemented as a separate component connected to the 

10 plug- in unit 2, but it may also be implemented as a 
part of the plug-in unit. The interface circuit 1. is 
e.g. a functional entity implemented using a FPGA cir- 
cuit (FPGA, Field Programmable Gate Array) , in -which 
case certain functions of the plug- in unit 2 as well 

15 can be implemented in the same circuit. Corresponding 
functions can also be achieved using discrete compo- 
nents or an ASIC circuit (ASIC, Application Specific 
Integrated Circuit) . 

The interface circuit 1 comprises the re- 

20 quired components and program blocks - reference is 
made to the above-mentioned publication - for imple- 
menting the communication between the plug- in unit 2 
and the bus PCI, so the interface circuit acts as a 
link between the plug-in unit 2 and the bus functions. 

25 According to the invention, the interface circuit 1 
comprises a watchdog timer 3 (WDT) , which monitors the 
execution times of addressing operations in ^the bus 
PCI and initiates actions for the indication and 
elimination of the error situation if the execution 

30 time exceeds a predetermined time limit. 

Fig. lb presents a diagram representing an 
embodiment of the interface circuit 1 . In addition to 
the watchdog timer 3, the interface circuit comprises 
means 4 for activating the watchdog timer upon the 

35 start of addressing directed at the plug- in unit 2. In 
practice, these means 4 are implemented in conjunction 
with the signalling part of the interface circuit by 
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using a software block which starts timing upon de- 
tecting a given signal or signals in an active state. 
Other methods known to the skilled person may also be 
used to implement the said means 4 . The software block 
5 4 also identifies the address of the plug-in unit 2 
connected to the interface circuit 1. This ensures 
that only the watchdog timer of the right plug- in unit 
2 will be started, which means that plug- in units less 
frequently addressed will not produce any unnecessary 

10 fault signals. 

TARGET ABORT type termination of addressing 
is implemented using a given software section or block 
5, in which the aborting function is triggered by the 
status of the watchdog timer 3. In practice, the trig- 

15 gering factor is timer overflow. In this case, the 
functionality of the interface circuit 1 may save the 
entire system from crashing even if the plug- in unit 
should be defective. TARGET ABORT type termination of 
addressing refers to abnormal termination of address- 

20 ing in a situation where the addressed plug- in unit 
(target) detects a fatal malfunction or is unable to 
execute a request addressed to it . The interface cir- 
cuit 1 also comprises means 6 for setting the SERR# 
signal into active state in the bus if the duration of 

25 addressing has exceeded the time value preset in the 
watchdog timer 3. In practice, this, too, is a fiinc- 
tional property of the interface circuit and these 
means 6 can be implemented using a suitable program or 
program block. The SERR# signal is used in the system 

30 to report errors that result in serious malfunctions 
of the system. Further, using means 7, the Signaled 
System Error bit is set into active state in the 
status register STATUS of the plug- in unit 2 if the 
duration of addressing has exceeded the time value 

35 preset in the watchdog timer, i.e. if addressing has 
been interrupted. This, too, is a functional property 
of the interface circuit and means 7 can be imple- 
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merited using a suitable program or program block. Let 
it additionally be stated that the watchdog timer 3 
and the means 4-7 provided in the interface circuit 
can be implemented in an FPGA circuit or using dis- 
5 Crete components. The function of the invention may be 
implemented in all of the interface circuits 1 or in 
only some of them. 

Fig. 2 presents an example giving a more de- 
tailed representation of the components shown in Fig. 

10 la and lb. Fig. 2 illustrates the components and sig- 
nalling in the interface circuit of a plug- in unit or 
interface unit at block diagram level. The interface 
circuit is connected to a CompactPCI bus (CompactPCI 
BUS) . The operation and function of these components 

15 are obvious to the skilled person and therefore we 
shall not describe them in detail except for parts 
that are significant in respect of the invention. The 
watchdog timer (Fig. 2) is started when an addressing 
sequence in the PCI bus begins, i.e. when a plug -in 

20 unit behind an interface circuit is addressed via the 
bus. The interface circuit detects the addressing e.g. 
by an active IDSEL signal indicating the selection of 
a plug -in unit- 

Before addressing is started, the PCI bus has 

25 to request access to the internal bus of the plug- in 
unit by setting the PCI_BREQ signal into active state. 
To the watchdog timer, this is an indicatioia of ad- 
dressing being started, and it is started. When the 
User Interface logic hands over the internal bus to 

30 the PCI bus, the PCI_BGNTn signal is set into active 
state and at the same time the watchdog timer is ad- 
vised to stop covmting. This action resets the watch- 
dog timer. Having gained access to the internal bus, 
the PCI bus starts a write or read cycle by setting 

35 the PCI_WRITE or PCI_READ signal into active state. 

If the PCI circuit is in WAIT state, in which 
case each bus cycle has to be acknowledged with a 
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READY signal, then the watchdog timer is started. In 
this case, the watchdog timer is stopped when the 
READYn signal is active, the PCI_WRITE or PCI_READ 
signal being thereby deactivated, which in practice 
5 means that the operation has been complet.ed. The PCI 
circuit is in WAIT state when software or FPGA circuit 
code is being loaded into the unit in question or when 
the unit requires that all addressing operations be 
perfoiroed on a WAIT basis. When the PCI circuit is not 

10 in WAIT state, the watchdog timer does not monitor in- 
dividual addressing operations; instead, it only moni- 
tors allocation requests for the unit's internal' bus 
on the basis of the PCI_BREQn and PCI_BGNTn signals. 

If the time limit set in the watchdog timer 

15 is reached while the PCI circuit is in WAIT state, 
then the following actions will be performed: 

addressing is aborted or the PCI bus cycle is in- 
terrupted, 

the control signal applied to the bus of the iinit 
20 is deactivated, 

the internal status engines of the PCI circuit are 
initialized, 

the SIGNALED SYSTEM ERROR bit in the STATUS regis- 
ter is set, 

25 the SIGNALED TARGET ABORT bit in the STATUS regis- 

ter is set, 

the SERR# signal for the CompactPCI bus is, acti- 
vated, whereupon the circuit returns to initial state 
to wait for new addressing. 

30 In practice, the software in the master computer con- 
trolling the CompactPCI bus further disables the de- 
fective plug- in unit, and the party responsible for 
system maintenance is informed about the fault. 

Fig. 3 presents a flow diagram giving the 

35 steps comprised in a method according to the inven- 
tion. In step 10, a plug- in unit 1 is addressed from 
the bus PCI. The addressing may be I/O type addressing 
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or memory addressing. The addressing device may t»e 
e.g. a bus master unit 2^. In step 11, the interface 
circuit 1 detects the addressing and starts the watch- 
dog timer 3. In steps 12 and 13, a check is carried 
5 out to establish the state of the addressing in rela- 
tion to the watchdog timer 3 . If the addressing is 
terminated before an overflow occurs in the watchdog 
timer 3 , then the timer is stopped and left in an in- 
active state, waiting for the next occurrence of ad- 

10 dressing. If an overflow occurs in the watchdog timer 
3, then the procedure goes on to step 14. In step^ 14, 
a Target Abort type termination of addressing is ■ per- 
formed. In step 15, the SERR# signal is set into ac- 
tive state in the bus PCI. In step 16, the Signaled 

15 System Error bit is set into active state in the 
status register of the plug-in unit 2. 

The invention is not restricted to the exam- 
ples of its embodiments described above; instead, many 
variations are possible within the scope of the inven- 

20 tive idea defined in the claims. 



